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Abstract. In this paper, we provide a new characterization of uniformly recurrent words with 
finite defect based on a relation between the palindromic and factor complexity. Furthermore, 
we introduce a class of morphisms P ret closed under composition and we show that a uniformly 
■ recurrent word with finite defect is an image of a rich (also called full) word under a morphism 

of class Pret- This class is closely related to the well-known class P defined by Hof, Knill, and 
Simon; every morphism from P ret is conjugate to a morphism of class P. 
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1. Introduction 



The upper bound \w\ + 1 on the number of palindromes occurring in a finite word w given 
by X. Droubay, J. Justin, and G. Pirillo in [10] initiated many interesting investigations on 
■ palindromes in infinite words as well. An infinite word for which the upper bound is attained for 

any of its factors is called rich or full. There exist several characterizations of rich words based 
on the notion of complete return words [12], on the longest palindromic suffix and prefix of a 
factor [10, 7], on the palindromic and factor complexity [6] and most recently on the bilateral 
! orders of factors [3] . Brlek et al. suggested in [5] to study the defect of a finite word w defined as 

' the difference between the upper bound |«?| + 1 and the actual number of palindromes contained 

in w. The defect of an infinite word is then defined as the maximal defect of a factor of the 
infinite word. In this convention, rich words are precisely the words with zero defect. In this 
I/"") ■ paper we focus on uniformly recurrent words with finite defect. Let us point out that periodic 

q{ . words with finite defect have been already described in [5] and in [12]. In Section 2 we introduce 

notation and summarize known results on rich words and words with finite defect. In Section 3 
the notion of oddities and the characterization of uniformly recurrent words with finite defect 
based on oddities from [12] is recalled and, as an immediate consequence, two more useful 
characterizations are deduced. The main result is a new characterization of uniformly recurrent 
^ . words with finite defect based on a relation between the palindromic and factor complexity, see 

Theorem 4.1 in Section 4. Furthermore, we introduce a class of morphisms P re t closed under 
composition of morphisms and we show that a uniformly recurrent word with finite defect is 
an image of a rich word under a morphism of class P re t, see Theorem 5.5 in Section 5. This 
class is closely related to the well-known class P defined by Hof, Knill, and Simon in [13]; every 
morphism from P re t is conjugate to a morphism of class P. 

2. Preliminaries 

By A we denote a finite set of symbols, usually called letters; the set A is therefore called an 
alphabet. A finite string w = wqWi . . . w n -\ of letters of A is said to be a finite word, its length 
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is denoted by \w\ = n. Finite words over A together with the operation of concatenation and 
the empty word e as the neutral element form a free monoid A*. The map 

W = WqWi . . . W n -i t-> W = W n -\W n -2 ■ ■ ■ Wo 

is a bijection on A*, the word w is called the reversal or the mirror image of w. A word w which 
coincides with its mirror image is a palindrome. 

Under an infinite word we understand an infinite string u = uqu±U2 ... of letters from A. 
A finite word w is a factor of a word v (finite or infinite) if there exist words p and s such that 
v = pws. If p = e, then w is said to be a prefix of v, if s = e, then w is a suffix of v. 

The language £(u) of an infinite word u is the set of all its factors. Factors of u of length n 
form the set denoted by C n (u). Clearly, C(u) = U ne ^C n (u). We say that the language £(u) is 
closed under reversal if £(u) contains with every factor w also its reversal w. 

For any factor w G £(u), there exists an index i such that w; is a prefix of the infinite word 

UiUi + \Ui + 2 Such an index is called an occurrence of w in u. If each factor of u has infinitely 

many occurrences in u, the infinite word u is said to be recurrent. It is easy to see that if the 
language of u is closed under reversal, then u is recurrent (a proof can be found in [12]). For 
a recurrent infinite word u, we may define the notion of a complete return word of any w G C(u). 
It is a factor v G £(u) such that w is a prefix and a suffix of v and w occurs in v exactly twice. 
Under a return word of a factor w is usually meant a word q G £(u) such that gu; is a complete 
return word of w. If any factor u; G £(u) has only finitely many return words, then the infinite 
word u is called uniformly recurrent. If u is a uniformly recurrent word, we can assign to any 
n G N the minimal number R u (n) G N such that we have for any v G £(u) with \v\ > R u (n) 

{w | \w\ = n, w is a factor of v} = C n (u), 

or equivalently, any piece of u which is longer than or equal to R u (n) contains already all 
factors of u of length n. The map n — > R u ( n ) is usually called the recurrence function of u. In 
particular, any fixed point of a primitive morphism is uniformly recurrent, where a morphism ip 
over an alphabet A is primitive if there exists an integer k such that for every a G A the fc-th 
iteration tp k {a) contains all letters of A. 

The factor complexity of an infinite word u is a map C : N i-> N defined by the prescription 
C{n) := #£ n (u). To determine the first difference of the factor complexity, one has to count the 
possible extensions of factors of length n. A right extension of w G £(u) is any letter a G A such 
that wa G C(u). Of course, any factor of u has at least one right extension. A factor w is called 
right special if w has at least two right extensions. Similarly, one can define a left extension and 
a left special factor. We will deal only with recurrent infinite words u. In this case, any factor of 
u has at least one left extension. We say that w is a bispecial factor if it is right and left special. 

In our article we focus on words in some sense opulent in palindromes, therefore we will 
introduce several notions connected with palindromic factors. 

The defect D(w) of a finite word w is the difference between the utmost number of palindromes 
+ 1 and the actual number of palindromes contained in w. Finite words with zero defects - 
called rich words - can be viewed as the most saturated by palindromes. This definition may 
be extended to infinite words as follows. 

Definition 2.1. An infinite word u = U0U1U2 . . . is called rich, if for any index n G N the prefix 
uqU\U2 ■ ■ ■ u n -i of length n contains exactly n + 1 different palindromes. 

Remark 2.2. We keep here the terminology introduced by Glen et al. in [12] in 2007, which 
seems to us to be prevalent nowadays. However, Brlek et al. in [5] baptized such words full 
already in 2004- 
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Let us remark that not only all prefixes of rich words are rich, but also all factors are rich. 
A result from [10] will provide us with a handful tool which helps to evaluate the defect of a 
factor. 

Proposition 2.3 ([10]). A finite or infinite word u is rich if and only if the longest palindromic 
suffix of w occurs exactly once in w for any prefix w of u. 

The longest palindromic suffix of a factor w will occur often in our considerations, therefore 
we will denote it by lps(w). In accordance with the terminology introduced in [10], the factor 
with a unique occurrence in another factor is called unioccurrent. From the proof of the previous 
proposition directly follows the next corollary. 

Corollary 2.4. The defect D(w) of a finite word w is equal to the number of prefixes w' of w, 
for which the longest palindromic suffix of w' is not unioccurrent in w' . 

This corollary implies that D{v) > D(w) whenever w is a factor of v. It enables to give 
a reasonable definition of the defect of an infinite word (see [5]). 

Definition 2.5. The defect of an infinite word u is the number (finite or infinite) 

.D(u) = sup{D(w) | w is a prefix of u} . 

Let us point out several facts concerning defects that are easy to prove: 

(1) If we consider all factors of a finite or an infinite word u, we obtain the same defect, i.e., 

£>(u) = sup{D(w) | w € £(u)} . 

(2) Any infinite word with finite defect contains infinitely many palindromes. 

(3) Infinite words with zero defect correspond exactly to rich words. 

Periodic words with finite defect have been studied in [5] and in [12]. It holds that the defect of 
an infinite periodic word with the minimal period w is finite if and only if w = pq, where both p 
and q are palindromes. In [12] words with finite defect have been baptized almost rich and the 
richness of a word was described using complete return words. 

Proposition 2.6 ([12]). An infinite word u is rich if and only if all complete return words of 
any palindrome are palindromes. 

The authors of [10] who were the first ones to tackle this problem showed that Sturmian and 
episturmian words are rich. In [5], an insight into the richness of periodic words can be found. 

The number of palindromes of a fixed length occurring in an infinite word is measured by 
the so called palindromic complexity V, a map which assigns to any non- negative integer n the 
number 

V(n) := #{w € C n (u) \ w is a palindrome} . 

The palindromic complexity is bounded by the first difference of factor complexity. The following 
proposition is proven in [2] for uniformly recurrent words, however the uniform recurrence is not 
needed in the proofs, thus it holds for any infinite words with language closed under reversal. 

Proposition 2.7 ([2]). Let u be an infinite word with language closed under reversal. Then 
(1) V(n) + V(n + 1) <C(n + l) -C(n) + 2, 

for all n G N. 

It is shown in [6] that this bound can be used for the characterization of rich words as well. 
The following proposition states this fact. 
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Proposition 2.8 ([6]). An infinite word u with language closed under reversal is rich if and 
only if the equality in (1) holds for all n G N. 

The most recent characterization of rich words given in [3] exploits the notion of the bilateral 
order b(w) of a factor and the palindromic extension of a factor. The bilateral order was 
introduced in [8] as b(u>) = #{awb \ awb G £(u),a,b G A} — if {aw \ aw G C(u),a G A} — ff{wb \ 
wb £ £(u),b £ A} + 1 . The set of palindromic extensions of a palindrome w G £(u) is defined 
by Pext(u>) = {awa \ awa G £(u),a G A}. 

Proposition 2.9 ([3]). An infinite word u with language closed under reversal is rich if and 
only if any bispecial factor w satisfies: 

• if w is non-palindromic, then h(w) = 0, 

• if w is a palindrome, then b(w) = #Pext(w) — 1. 

3. Characterizations of words with finite defect 

Uniformly recurrent words with finite defect are characterized using the notion of oddities in 
Proposition 4.8 from [12]. It is based on the following lower bound. 

Proposition 3.1 (Proposition 4.6 [12]). For any infinite word u it holds 

D(u) > v} | v ^ v and vorvisa complete return word in u of a palindrome w}. 

The set {v,v} is called an oddity. It is clear that for uniformly recurrent words with a finite 
number of distinct palindromes, the defect is infinite, however the number of oddities is finite. 
Moreover, even for uniformly recurrent words with infinitely many palindromes, it can hold 

-D(u) > #{{v,v} | v v and v or v is a complete return word in u of a palindrome w}. 

We take an example for this situation from [12]. 

Example 3.2. Let u = (abcabcacbacb)^ , where u denotes an infinite repetition, then D(u) = 4 ; 
but the number of oddities is equal to 3. 

However, the defect of an aperiodic word can also exceed the number of oddities. For instance, 
if we replace in Example 3.5 the substitution a with — > cabcabcbacbac, 1 — > d, then it is easy 
to show that D(u) = 4, but the number of oddities is 3. 

We can now recall the characterization of words with finite defect based on oddities. 

Proposition 3.3 (Proposition 4.8 [12]). A uniformly recurrent word u has infinitely many 
oddities if and only if u contains infinitely many palindromes and D(u) = oo. 

As an immediate consequence of Proposition 3.3, we obtain the following characterizations of 
infinite words with finite defect. 

Theorem 3.4. Let u be a uniformly recurrent word containing infinitely many palindromes. 
Then the following statements are equivalent: 

1. D(u) < oo, 

2. u has a finite number of oddities, 

3. there exists an integer K such that all complete return words of any palindrome from 
C(u) of length at least K are palindromes, 

4- there exists an integer H such that for any prefix f of u with \f\ > H the longest 
palindromic suffix of f is unioccurrent in f . 
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Proof. 1. and 2. are equivalent by Proposition 3.3. It follows directly from the definition of 
oddities that 2. and 3. are equivalent. Corollary 2.4 implies that 1. and 4- are equivalent. 

□ 

It is easy to see that the last statement of Theorem 3.4 can be equivalently rewritten as: 
There exists an integer H such that for any factor / of u with |/| > H the longest palindromic 
suffix of / is unioccurrent in /. 

Let us stress that if we put in the previous theorem D(u) = K = H = 0, the points 1., 3., 
and 4- become known results on rich words, see Propositions 2.6 and 2.3. 

Example 3.5. Let us provide an example of a uniformly recurrent word u with finite defect and 
let us find for u the lowest values of constants K and H from Theorem 3.4- Take the Fibonacci 
word v, i.e., the fixed point of ip : — > 01, 1 — > 0. Define u as its morphic image cr(v), where 
a : — >■ cabcbac, 1 — >■ d. 

It is easy to show that all palindromes of length greater than 1 and the palindromes a, b, and 
d have only palindromic complete return words. Hint: long palindromes in u contains in their 
center images of non-empty palindromes from v that have palindromic complete return words by 
the richness o/v. The only non-palindromic complete return of c is cabc, thus there is exactly 
one oddity {cabc,cbac}. In order to show that D(u) = 1, it suffices to verify that no prefixes 
longer than cabc have c as their longest palindromic suffix. This follows directly from the form 
of a. The lowest values of the constants K and H are: K = 2, H = 5. 

4. Palindromic complexity of words with finite defect 

The aim of this section is to prove the following new characterization of infinite words with 
finite defect based on a relation between the palindromic and factor complexity. 

Theorem 4.1. Let u be a uniformly recurrent word. Then D(u) < oo if and only if there exists 
an integer N such that 

V{n) + V{n + 1) = C(n + 1) - C(n) + 2 

holds for all n > N . 

Notice that if we set N = in the previous theorem, then we obtain the known characterization 
of rich words from Proposition 2.8 (which holds even under a weaker assumption that C(u) is 
closed under reversal). 

In the sequel, we will prove two propositions that together with the equivalent characteri- 
zations of words with finite defect from Theorem 3.4 imply Theorem 4.1. As we have already 
mentioned, all words with language closed under reversal satisfy the inequality in Proposi- 
tion 2.7. A direct consequence of its proof given in [2] is a necessary and sufficient condition 
for the equality in (1). To formulate this condition in Lemma 4.2, we introduce two auxiliary 
notions. 

Let u be an infinite word with language closed under reversal and let n be a given positive 
integer. 

An n-simple path e is a factor of u of length at least n + 1 such that the only special (right 
or left) factors of length n occurring in e are its prefix and suffix of length n. If w is the prefix 
of e of length n and v is the suffix of e of length n, we say that the n-simple path e starts in w 
and ends in v. 

We will denote by G n an undirected graph whose set of vertices is formed by unordered pairs 
(w,w) such that w G C n {u) is right or left special. We connect two vertices (w,w) and (v, v) by 
an unordered pair (e, e) if e or e is an n-simple path starting in w or w and ending in v or v. 
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Note that the graph G n may have multiple edges and loops. 

Lemma 4.2. Let u be an infinite word with language closed under reversal. The equality in (1) 
holds for an integer n G N if and only if both of the following conditions are met: 

1. The graph G n after removing loops is a tree. 

2. Any n-simple path forming a loop in the graph G n is a palindrome. 

Proposition 4.3. Let u be an infinite word with language closed under reversal. Suppose that 
there exists an integer N such that for all n > N the equality V(n) + V(n + 1) = C{n + 1) — 
C{n) + 2 holds. Then the complete return words of any palindromic factor of length n > N are 
palindromes. 

Proof. Assume the contrary: Let p = p\P2 ■ ■ ■ Pk be a palindrome with k > N and let v be its 
complete return word which is not a palindrome. Clearly \v\ > 2\p\. Then there exist a factor / 
(possibly empty) and two different letters x and y such that v = pfxv'yfp. 

Let us consider the graph G n , where n is the length of the factor w := pf, i.e., n> N. Since 
the language of u is closed under reversal, the factor w is right special - the letters x and y 
belong to its right extensions. 

If the complete return word v contains no other right or left special factors, then the non- 
palindromic v is an n-simple path which starts in w = pf and ends in w = fp - a contradiction 
with the condition 2. in Lemma 4.2. 

Let v contain other left or right special factors of length n. We find the prefix of v which is 
an n-simple path. This simple path starts in w, its ending point is a special factor, we denote 
it by A. Since v is a complete return word of p, we have A ^ w,w. So in the graph G n , the 
vertices (w, w) and (A, A) are connected with an edge. Similarly, we find the suffix of v which is 
an n-simple path and we denote its starting point by B, its ending point is w. Again, B ^ w,w 
and the vertices (w,w) and (B,B) are connected with an edge. So in G n we have a path with 
two edges which connects (A, A) and (B,B) and the vertex (w,w) is its intermediate vertex. 

The special factors A and B are factors of p2 ■ ■ -Pkf xv 'yfPk ■ ■ -P2, it means that in the graph 
G n there exists a walk, and therefore a path 1 as well, between the vertices (A, A) and (B,B) 
which does not use the vertex (w,w). 

Finally, if (A, A) and (B, B) coincide, then we have in G n a multiple edge between (A, A) 
and (w,w). If (A, A) / (B,B), then in G n we have two different paths connecting (A, A) and 
(B,B). Together, G n is not a tree after removing loops - a contradiction with the condition 1. 
in Lemma 4.2. 

□ 

Lemma 4.4. Let u be an infinite word whose language is closed under reversal. Let u have the 
following property: there exists an integer H such that for any factor f £ £(u) with \ f\ > H 
the longest palindromic suffix of f is unioccurrent in f. Let w be a non-palindromic factor of u 
with \w\ > H and v be a palindromic factor of u with \v\ > H . Then 

• occurrences of w andw in u alternate, i.e., any complete return word of w contains the 
factor w, 

• any factor e of u with a prefix w and a suffix w, which has no other occurrences of w 
and W, is a palindrome, 

• any complete return word of v is a palindrome. 



Along a walk vertices may occur with repetition, in a path any vertex appears at most once. 
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Proof. Consider a non-palindromic factor w such that |it>| > H. Let / be a complete return 
word of w. Since \w\ > H, its complete return word satisfies |/| > H. According to the 
assumption, lps(f), the longest palindromic suffix of /, is unioccurrent in /. Its length satisfies 
necessarily \lps(f)\ > \w\ - otherwise a contradiction with the unioccurrence of lps(f). Clearly, 
the palindrome lps(f) has a suffix w and thus a prefix w, i.e., the complete return word / of 
w contains w as well. Moreover, we have proven that any factor e, which has a prefix w and 
a suffix w and which has no other occurrences of w and w, is the longest palindromic suffix of 
a complete return word of w, therefore e = lps(f), i.e., the factor e is a palindrome. 

Consider a palindromic factor v, its complete return word / and the longest palindromic 
suffix of /. Since v is a palindromic suffix of /, necessarily \lps(f)\ > \v\. As \v\ > H, lps(f) is 
unioccurrent in /. Hence, \lps(f)\ > \v\. If lps(f) is shorter than the whole /, then the complete 
return word / contains at least three occurrences of w - a contradiction. Thus, lps(f) = f, i.e., 
/ is a palindrome. 

□ 

Proposition 4.5. Let u be an infinite word whose language is closed under reversal. Let u have 
the following property: there exists an integer H such that for any factor f £ £(u) with |/| > H 
the longest palindromic suffix of f is unioccurrent in f . Then 

2 + C(n + 1) - C(n) = V(n + 1) + V(n) for any n>H. 

Proof. We have to show that both conditions of Lemma 4.2 are satisfied for any n > H. 

The condition 1.: Let (w,w) and (v,v) be two distinct vertices in the graph G n , where n> H. 

We say that an unordered couple (/, /) is a realization of a path between these two vertices if 

• either the factor / or the factor / has the property: w or w is its prefix and v or v is its 
suffix, 

• there exist indices i,£ <G N, i < I such that either the factor / or the factor / coincides 
with the factor UjUj+i . . . Ug and factors w, w, v, and v do not occur in . . . ug-\. 

The number % is called an index of the realization (/, /). 

Since u is recurrent, there exists at least one realization for any pair of vertices (w,w) and 
(v, v) and any realization has infinitely many indices. Consider a realization (/, /) and its index i. 
WLOG / = UiUi + i . . . U£ and w is a prefix of / and v a suffix of /. Since u is recurrent, we can 
find the smallest index m > t such that v! = UiUi+i . . . ug . . . u m has a suffix W. According to 
Lemma 4.4, u' is a palindrome. Therefore its suffix of length |/| is exactly /. This means that 
the index m — \ f\ + 1 is an index of the same realization of a path between (w,w) and (v,v). As 
the factor Uj+i . . . u m -\ does not contain neither the factor w nor w, no index j strictly between 
i and m — \f \ + 1 is an index of any realization of a path between (w,w) and (v,v). 

We have shown that between any pair of two consecutive indices of one specific realization 
(/, /) of a path between (w,w) and (v,v) there does not exist any index of any other realization 
(g,g) of a path between (w,w) and (v,v). This means that there exists a unique realization of 
a path between (w,w) and (v,v), which implies that in the graph G n there exists a unique path 
between vertices (w,w) and (v,v). Since this is true for all pairs of vertices of G n , the graph G n 
after removing loops is a tree. 

The condition 2.: Let w € £(u) be a special factor (palindromic or non-palindromic) with 
\w\ = n > H. An n-simple path / starting in w and ending in w contains according to its 
definition no other special vertex inside the path, in particular w and w do not occur inside the 
path. According to Lemma 4.4, the path / is a palindrome. □ 



8 



BALKOVA, PELANTOVA, AND STAROSTA 



Proof of Theorem 4-1- It is a direct consequence of Propositions 4.3 and 4.5 and of Theorem 3.4, 
where the last statement is replaced with an equivalent one: There exists an integer H such that 
for any factor / of u with |/| > H the longest palindromic suffix of / is unioccurrent in /. □ 

5. MORPHISMS OF CLASS P ret 

In this section, we will define a new class of morphisms and we will reveal their relation 
with well-known morphisms of class P (defined in [13]). We will show an important role these 
morphisms play in the description of words with finite defect. 

Definition 5.1. We say that a morphism (p : B* i— >■ A* is of class P re t if there exists a palindrome 
p £ A* such that 

• <p(b)p is a palindrome for any b G B, 

• ip(b)p contains exactly 2 occurrences of p, one as a prefix and one as a suffix, for any 
beB, 

• ip(b) ^ 99(c) for all b,c G B, b 7^ c. 

Remark 5.2. The following properties of the morphisms of class P re t are easy to prove. 

(1) (p(w) = p(v), where w,v £ B* , implies w = v, i.e., (p is injective, 

(2) (p(x)p = (p(x)p for any x G B* , 

(3) (f(s)p is a palindrome if and only if s G B* is a palindrome. 

Hint for the proof of the injectivity: If ip(w) = (p(v), then (p(w)p = (p(v)p. This implies w = v by 
induction on max{|ti;|, the assertion is true for m&x{\w\, \v\} = 1 (i.e., \w\ = \v\ = 1, since 
the morphism if is not erasing from the second point of Definition 5.1) from the third point of 
Definition 5.1; the induction is then proven using the second point of Definition 5.1. 

Another class of morphisms closely related to defects is standard (special) morphisms of class 
P defined in [12]. We will reveal their connection with P ret in Section 6. 

Proposition 5.3. The class P re t is closed under the composition of morphisms, i.e., for any 
(p,a G P re t we have tpa G P re t (if the composition is well defined). 

Proof. Let p^ and p a be the corresponding palindromes from the definition of P re t of the mor- 
phisms if and a, respectively. Then p^ := <f(p a )Ptp is a palindrome by point (3) of Remark 5.2 
for (p. It suffices to verify that p^ plays the role of the palindrome p for the morphism (pa. 

• Take b a letter. We have ((pa) = (p(a(b)p a )p v . We obtain the following equalities 
using firstly point (2) of Remark 5.2 for ip and then for a: 

V{0-{b)Pa)P V = ^{<y{b)Pa)P V = <P(<r(b)Pa)P<p = ¥{o{ h )Pcr)Pv = (^(bJPpa, 

i.e., ((pa)(b)p ipa is a palindrome for all b. 

• Since (p G P re t, there is a one-to-one correspondence between the occurrences of p vo - = 
(p(Pa)Pifi in (<po~) (b)p va = ^(a^p^p^ and the occurrences of p a in a(b)p a . As a G P re t, 
the word a(b)p a contains p a only as a prefix and as a suffix. Therefore ^(a^p^p^ has 
only two occurrences of (p(pa)Pifi - as a prefix and as a suffix. 

• The injectivity of (p and a clearly guarantees that (<pa)(b) ^ ((pa)(c) for all b / c. 

□ 

In [13] another class of morphisms is defined. We say that a morphism (p is of class P if there 
exist a palindrome p and for every letter a a palindrome q a such that (p(a) = pq a - The interest 
of the class P has been awoken by the following question stated ibidem (however formulated 
in terms of dynamical systems): "Given a fixed point of a primitive morphism <p containing 
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infinitely many palindromes, can we find a primitive morphism a of class P such that the 
factors of a fixed point of a are the same?" Let us recall that for any primitive morphism, 
the languages of all its fixed points are the same. The previous question has been answered 
affirmatively in [14] for morphisms defined on binary alphabets and in [1] for periodic fixed 
points. 

In order to reveal the relation between the classes P and P re t, we have to define the conjugation 
of a morphism. A morphism a is said to be conjugate to a morphism ip defined on an alphabet 
A if there exists a word w G A* such that 

• either for every letter a G A, the image ip(a) has w as its prefix and the image <r{a) is 
obtained from ip(a) by erasing w from the beginning and adding w to the end; we write 
a(a) = w~ 1 ip(a)w, 

• or for every letter a G A, the image ip{a) has w as its suffix and the image o~(a) is 
obtained from (p(a) by erasing w from the end and adding w to the beginning; we write 
a(a) = w<p{a)w~ l . 

Proposition 5.4. If ip is a morphism of class P re t, then ip is conjugate to a morphism of 
class P. 

Proof. Let p G P ret and let p have the same meaning as in the definition of P re t- We will write 
p = qxq, where q 6 A* and x is either the empty word or a letter. Denote by a a morphism 
defined for all letters a as a(a) = q^ 1 p(a)q. Thus, ip is conjugate to a. 

The word q~ 1 p(a)q can be written as xy a since qx is a prefix of p{a)q. Since <p{a)qxq is a 
palindrome, q~ l p(a)qxq q -1 = xy a x is a palindrome too. Therefore y a is a palindrome and a is 
of class P. □ 

The implication cannot be reversed. Consider the alphabet {a, b} and let p(a) = aa and 
ip(b) = ab. It is clear that ip £ P (for p = a), but (p is not conjugate to any morphism of class 
P re t {aaa is not a complete return word of a). 

The following theorem shows the importance of morphisms of class P re t for uniformly recurrent 
words with finite defect. 

Theorem 5.5. Let u G A^ be a uniformly recurrent word with finite defect. Then there exist 
a rich word v G £> N and a morphism ip : B* >->■ A* of class P ret such that 

u = tp(\). 

The word v is uniformly recurrent. 

Proof. Consider a prefix z of u of length \z\ > max{2i? u (K), H}, where K is the constant 
from Theorem 3.4 and H is an integer such that any factor of u of length > H has its longest 
palindromic suffix unioccurrent. (Let us recall that the existence of H is also guaranteed by 
Theorem 3.4.) Since the language of u is closed under reversal (this follows from the fact that u 
is uniformly recurrent and contains infinitely many palindromes), I is a factor of u as well and 
its lps(z) has a unique occurrence in ~z. As \z\ > 2R U (K) any factor shorter than or equal to K 
occurs in ~z at least twice. Therefore, \lps{^z)\ > K. Hence, lps(z) is a palindromic prefix of u of 
length greater than K. 

Denote p := lps(z). Since u is uniformly recurrent, the set of return words of p is finite, 
say qo, q±, ... , q m -\ is the list of all different return words. Let us define a morphism ip on the 
alphabet B = {0, 1, . . . , m — 1} by (p(b) = qb for all b G B. It is obvious that the morphism 
belongs to the class P re t- Then we can write u = q^q^q^ ... for some sequence (i n ) n eN G Z3 N . 
Let us put v = (i n ) n m- 
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We will show that any complete return word of any palindrome in the word v is a palindrome 
as well. According to Proposition 2.6 this implies the richness of v. 

Let s be a palindrome in v and x its complete return word. Then <p(x)p has precisely two 
occurrences of the factor <p(s)p. As s is a palindrome, <p(s)p is a palindrome as well of length 
— \p\ > K- Therefore (p{x)p is a complete return word of a long enough palindrome 
and according to our assumption f(x)p is a palindrome as well. This together with point (3) in 
Remark 5.2 implies 

tp(x)p = <p(x)p = (p(x)p. 
The point (2) then gives x = x as we claimed. 

The uniform recurrence of v is obvious. □ 

The reverse implication does not hold, i.e., the set of uniformly recurrent words with finite 
defect is not closed under morphisms of class P re t- Let us provide a construction of such a word. 

Example 5.6. Let vq = e. For i > set V{ = (vi^i0vi^ilvi^ilvi^i0vi^\2vi^i2)^ + \ where 
denotes the shortest palindrome having w as a prefix. 

Note that Vi-i is a prefix of Vi for all i. Thus we can set v = lim Vi and v is uniformly 

i— s-oo 

recurrent by construction. 

Denote by if a morphism from P re t defined by 



if : < 



i y 0100 

1 h-> 01011 

2 h-> 010111 



As we will show in the sequel, the word v is rich and the defect D((p(v)) = oo. 

Lemma 5.7. For all i the palindrome Vi from Example 5.6 is rich. 

Proof. We will show for all i that v% is rich and 

Vi = Vi-i0vi-ilvi-ilvi-i0vi-i2vi-i2vi-i0vi-ilvi-ilvi-i0vi-i. 

Furthermore, we will show that for all letters x, the word ViXVi contains exactly 2 occurrences 
of Vi and 1 occurrence of 0t>i_ixui_i0. 

We will proceed by induction on i. For i = 1 and 2 it is left up to the reader to verify the 
proposition. 

Suppose the fact holds for i, i > 2. We will show the claim for i + 1. Denote by w the factor 

w := Vi0vilvilvi0vi2vi2. 

Note that since ViXVi contains exactly 2 occurrences of Vi for all letters x, the factor w contains 
exactly 6 occurrences of Vi. In other words, if we find 1 occurrence of Vi, we know all the other 
occurrences. 

In Table 1 we can see the total number of palindromic factors of w. Let us give a brief 
explanation for rows which may not be clear at first sight. Let us recall that by the induction 
assumption 

Vi = Vi-i0vi-ilVi-ilVi-i0vi-i2vi-i2vi-i0vi-ilvi-ilvi-i0vi-i. 

Since there are exactly 11 occurrences of vi-\ in Vi, one can easily see that factors in rows 2, 3, 
and 4 have not been counted in row 1. Rows 5, 6, and 7 exploit the fact that for all letters x 
ViXVi contains 1 occurrence of Of j_ixvj_i0. One can see that the total number of palindromic 
factors is 6\vi\ + 7 = \w\ + 1, therefore w is rich from the definition. 
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# 




palindromic factors of w 


count 


1 




palindromic factors of V{ 


\Vi\ + 1 


2 




fin- , n 




K-i| + 1 


3 


lVi- 


-lOwi-il, ... 


,Wj_ilwj_iOt;j_ilt;j_i 


i iii 

+ 1 




-l<JVl-l i-Vl-1 i-V x- 


-i0ui_i2, ... 


, Wj_ 1 2«j_ i Owj_ i lvi- 1 i Owj_i 2t>j_i 


Ui. , 1 i i 
\ v i-l \ ~r 1 


5 


Ovi- 


-lOui-iO, ... 




- \Vi-l\ 


6 


Ovi- 


-il«i-iO, ... 




\Vi\ - \Vi-l\ 


7 




-i2«i_i0, ... 


, Vi2vi 


\Vi\ - \Vi-l\ 


8 




lUjl, . . . 


,ViOvilvilviOvi 


2\vi\ + 2 


9 




2v { 2 




1 


total 








6|«i| + 7 



Table 1. Enumeration of palindromic factors of w. 



As the right palindromic closure preserves the richness, we can see that Vi+i is rich. Moreover, 
since there are exactly 2 occurrences of v% in ViXVi for all letters x, one can see that the closure 
will produce the following palindrome 

Uj+i = Vi0vilvilvi0vi2vi2vi0vilvilvi0vi. 

Take a letter x. We can now rewrite Vi + \xvi + \ in terms of Vi and see the factor Ovixv jO occurs 
once and fj+i occurs twice again arguing by the known count of factors v j. 

□ 

Proposition 5.8. T/ie infinite word v defined in Example 5.6 is rich and D(<p(y)) = oo, where 
ip is also defined in Example 5.6. 

Proof. Directly from the definition of v, one can see using the previous lemma that all its prefixes 
Vi are rich and therefore v is rich. 

Denote by p the palindrome from the definition of P re t for the substitution ip. One can see 
that p = 010. Take lfjl, a factor of v. We have <^(lujl) = 01011(^(^)^11, a factor of </?(v). 
Using point (3) of Remark 5.2, we can see that Oi := \(p{vi)p\ is a palindrome. Now take 2vi2. 
One can see that p(2vi2) = 010111<p(vi)plll. Note again the palindromic factor Oj. 

We will now look for complete return words of Oj in ip(ri), where 

Ti = Ivilvi0vi2vi2. 

The word is clearly a factor of Vi + ±, therefore a factor of v. The first occurrence of Oj is 
produced by the factor lvil in r^. Since ip is injective, we need to look only at occurrences 
of Vi in rj. The next two occurrences are in the factors lvjO and 0t>j2. One can see that 
(^(liijO) = 01011</?(/Uj)p0 and ip(0vil) = 0W0p(vi)pll, i.e., the factor Oj does not occur in <p(ri) 
until the factor <p(2vi2) occurs. The complete return word of is then Oi := I(p(vilvi0vi2vi)pl. 
By point of Remark 5.2, as Ujlt>j0t>j2t>j is not a palindrome, neither is the complete return 
word Oi. Therefore for each i we have an oddity {Oi,Oi}. According to Proposition 3.3, it 
implies the defect of (p(v) is infinite. 

□ 

The last proposition concludes the counterexample 5.6. 

Remark 5.9. It is clear that the defect of an image by a morphism of class P re t of a word with 
finite defect depends on the morphism. As Example 5.6 shows, it depends also on the original 
word. To underline this fact we can take the morphism ip from Example 5. 6 and u the Tribonacci 
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word, i.e., the fixed point of the Tribonacci morphism i— > 01, 1 *— > 02 and 2 i— > - a well-known 
rich word [10]. if is easy t° see that D(ip(u)) = 0. 

6. Comments 

At the end of the article [4], the authors state several open questions, among them the following 
one: "Let u be a fixed point of a primitive morphism. If the defect is finite and non-zero, is the 
word u necessarily periodic?" 

We are not able to answer this question. The following observation is just a small comment 
to it. 

Observation 6.1. Let u be a fixed point of a primitive morphism and let its defect D(u) be 
finite. Then there exists a rich word v and a morphism ip G P ret such that u = </?(v) and v itself 
is a fixed point of a primitive morphism as well. 

Proof. The rich word v, which we have constructed in the proof of Theorem 5.5, is a derived 
word, as introduced by Durand in [11]. Lemma 19 of [11] says that any derived word of a fixed 
point of a primitive morphism is a fixed point of a primitive morphism as well. 

□ 

Theorem 5.5 has the form of implication, which cannot be reversed, since Example 5.6 demon- 
strates that a morphism from P ret does not preserve automatically the set of words with finite 
defect. It is thus natural to ask: 

Question 1: Is it possible to replace the class P re t with a smaller one in such a way that 
Theorem 5.5 can be stated in the form of equivalence? 

Question 2: Which morphisms from P ret do preserve the set of rich words? 

Question 3: How to compute D((p(u)) for a rich word u and a morphism from <p £ P re p. 

Question 4: Is it feasible to characterize morphisms ip on B* with the property that <p(u) 
has finite defect for any infinite word u£B R with finite defect? 
Let us comment Question 1. The authors of [12] define another class of morphisms that play an 
important role in the study of finite defect. They call a morphism <p on A* a standard morphism 
of class P (or a standard P-morphism) if there exists a palindrome r (possibly empty) such 
that, for all x £ A, <p(x) = rq x , where the q x are palindromes. If r is non-empty, then some (or 
all) of the palindromes q x may be empty or may even take the form q x = w~ l with ir x a proper 
palindromic suffix of r. They say that a standard P-morphism is special if: 

(1) all ip(x) = rq x end with different letters, and 

(2) whenever <p(x)r = rq x r, with x G A, occurs in some ip(y±y2 ■ ■ ■ yn)r, then this occurrence 
is (p(y m )r for some m with 1 < m < n. 

They prove the following theorem. 

Theorem 6.2 (Theorem 6.28 [12]). If ip is a standard special P-morphism on A* and u € A*, 
then D(u) < oo if and only if D((p(u)) < oo. 

However, as shown in the following proposition, standard special P-morphisms are not the 
only ones that preserve the set of uniformly recurrent words with finite defect, thus the class of 
standard special morphisms is too small as an answer to Question 1. Let us add that standard 
special morphisms of class P do not form a subset of morphisms of class P re t- For instance, 
if : a — > aabbaabba, b — > ab is a standard special P-morphism with r = a, but does not belong 

to P re t- 

Proposition 6.3. Let u be a binary uniformly recurrent word such that D(u) is finite. Let ip 
be a morphism of class P re t- Then D (</?(u)) is finite. 
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Lemma 6.4. Let ip be a morphism of class P re t on {0, 1}*. Then ip is conjugate to a standard 
special P -morphism. 

Proof. Let p be the palindrome corresponding to ip in the definition of P re t- Denote by p\ the 
longest common suffix of <p(0) and <p(l) ■ Denote by p2 a word such that pp2 is the longest common 
prefix of <p(0)p and (p(l)p. Using properties of P re t we have p\ = p~2~. Define cr(0) = pi^^p^ 1 
and a(l) = pi<^(l)pj~ 1 . Then <p is conjugate to a and a is a standard special P-morphism with 
the corresponding palindrome r = p\pp\. □ 



Proof of Proposition 6.3. By Lemma 6.4 the morphism (p is conjugate to a standard special P- 
morphism a. Clearly, the languages of <p(u) and <t(u) are the same, hence D (</?(u)) = D (<r(u)). 
Theorem 6.2 implies that D(a(u)) < oo. □ 
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